Compressed pattern matching について

Words near each other

・ Compressed air car
・ Compressed air energy storage
・ Compressed air filters
・ Compressed air foam system
・ Compressed air gramophone
・ Compressed audio optical disc
・ Compressed data structure
・ Compressed file library
・ Compressed fluid
・ Compressed Gas Association
・ Compressed Hare
・ Compressed hydrogen
・ Compressed hydrogen tube trailer
・ Compressed magnetic flux generator
・ Compressed natural gas
・ Compressed pattern matching
・ Compressed sensing
・ Compressed sensing in speech signals
・ Compressed suffix array
・ Compressed-air vehicle
・ Compressibility
・ Compressibility equation
・ Compressibility factor
・ Compressible duct flow
・ Compressible flow
・ Compression
・ Compression (album)
・ Compression (astronomy)
・ Compression (functional analysis)
・ Compression (geology)

Dictionary Lists

mini英和辞書

翻訳と辞書　辞書検索 [ 開発暫定版 ]

スポンサードリンク

Compressed pattern matching ：ウィキペディア英語版

Compressed pattern matching

In computer science compressed pattern matching or CPM is the process of searching for patterns in compressed data with little or no decompression. Searching in a compressed string is faster than searching an uncompressed string and requires less space.
==Compressed matching problem==
If the compressed file uses a variable width encoding it could be present a problem: for example, let “100” be the codeword for ''a'' and let “110100” be the codeword for ''b''. If we are looking for an occurrence of ''a'' in the text we could obtain as result also an occurrence that is within the codeword of ''b'': we call this event ''false match''. So we have to verify if the occurrence detected is effectively aligned on a codeword boundary. However we could always decode the entire text and then apply a classic string matching algorithm, but this usually requires more space and time and often is not possible, for example if the compressed file is hosted online. This problem of verifying the match returned by the compressed pattern matching algorithm is a true or a false match together with the impossibility of decoding an entire text is called the compressed matching problem.
Many strategies exist for finding the boundaries of codewords and avoiding full decompression of the text, for example:
*List of the indices of first bit of each codeword, where we can apply a binary search;
*List of the indices of first bit of each codeword with differential coding, so we can take less space within the file;
*Mask of bit, where bit 1 marks the starting bit of each codeword;
*Subdivision in blocks, for a partial and aimed decompression.

抄文引用元・出典: フリー百科事典『ウィキペディア（Wikipedia）』
■ウィキペディアで「Compressed pattern matching」の詳細全文を読む

スポンサードリンク

翻訳と辞書 : 翻訳のためのインターネットリソース